Picture for Wonjong Rhee

Wonjong Rhee

Orthogonal Negative Guidance in Attention Feature Space for Text-to-Image Generation

Add code
May 28, 2026
Viaarxiv icon

When Confidence Misleads: Suffix Anchoring and Anchor-Proximity Confidence Modulation for Diffusion Language Models

Add code
May 27, 2026
Viaarxiv icon

Selective Aggregation of Attention Maps Improves Diffusion-Based Visual Interpretation

Add code
Apr 07, 2026
Viaarxiv icon

DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation

Add code
Oct 16, 2025
Figure 1 for DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation
Figure 2 for DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation
Figure 3 for DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation
Figure 4 for DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation
Viaarxiv icon

Soft Injection of Task Embeddings Outperforms Prompt-Based In-Context Learning

Add code
Jul 28, 2025
Figure 1 for Soft Injection of Task Embeddings Outperforms Prompt-Based In-Context Learning
Figure 2 for Soft Injection of Task Embeddings Outperforms Prompt-Based In-Context Learning
Figure 3 for Soft Injection of Task Embeddings Outperforms Prompt-Based In-Context Learning
Figure 4 for Soft Injection of Task Embeddings Outperforms Prompt-Based In-Context Learning
Viaarxiv icon

ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation

Add code
Jul 02, 2025
Figure 1 for ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation
Figure 2 for ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation
Figure 3 for ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation
Figure 4 for ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation
Viaarxiv icon

Task-Specific Preconditioner for Cross-Domain Few-Shot Learning

Add code
Dec 20, 2024
Figure 1 for Task-Specific Preconditioner for Cross-Domain Few-Shot Learning
Figure 2 for Task-Specific Preconditioner for Cross-Domain Few-Shot Learning
Figure 3 for Task-Specific Preconditioner for Cross-Domain Few-Shot Learning
Figure 4 for Task-Specific Preconditioner for Cross-Domain Few-Shot Learning
Viaarxiv icon

Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models

Add code
Dec 03, 2024
Figure 1 for Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models
Figure 2 for Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models
Figure 3 for Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models
Figure 4 for Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models
Viaarxiv icon

A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured Datasets

Add code
Oct 14, 2024
Figure 1 for A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured Datasets
Figure 2 for A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured Datasets
Figure 3 for A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured Datasets
Figure 4 for A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured Datasets
Viaarxiv icon

Towards a Better Evaluation of Out-of-Domain Generalization

Add code
Jun 02, 2024
Viaarxiv icon